CLOUD BASED MULTI-LANGUAGE INDEXING USING CROSS LINGUAL INFORMATION RETRIEVAL APPROACHES
نویسندگان
چکیده
The exponential growth of data sizes created by digital media (video/audio/images), physicalsimulations, scientific instruments and web authoring joins the new interest in cloud computing. options for distribution parallelization information clouds make retrieval storage processes very complicated, especially when faced with real-time management. quantity Web Users getting access to over Internet is expanding step step. An enormous measure on accessible various languages which could be accessed anyone whenever. Information Retrieval (IR) manages finding valuable from a huge assortment unorganized, organized semi-organized information. In present situation, variety language boundaries are difficult challenges communication social trade world. To tackle such obstructions, CLIR, cross-language frameworks, these days solid interest. Query Expansion (Q.E.) way toward adding related important terms original inquiry upgrade its indexing ability improve significance recovered files CLIR. this exploration work, Q.E. has been investigated Hindi-English Kannada-English CLIR that Hindi Kannada queries utilized look through English docs. After interpretation query, outcomes positioned making use OkapiBM25 organize most doc at top docs using QE. We proposed architecture importance reports. primary investigation, performed without ranking. show pertinence archives higher OKapiBM25 as contrast one positioning. work plainly demonstrate presentation framework can improved altogether query development fitting located suitable place Snippets incredibly fill continuous test collection.
منابع مشابه
Automatic Cross-Language Information Retrieval using Latent Semantic Indexing
We describe a method for fully automated cross-language document retrieval in which no query translation is required. Queries in one language can retrieve documents in other languages (as well as the original language). This is accomplished by a method that automatically constructs a multi-lingual semantic space using Latent Semantic Indexing (LSI). We present strong preliminary test results fo...
متن کاملVector-based Semantic Analysis using Random Indexing and Morphological Analysis for Cross-Lingual Information Retrieval
Meaning, the arguably most important theoretical object of study in information access, is most decidedly situationdependent. While much of meaning appears to achieve consistency across usage situations — a term will seem to mean much the same thing in many of its contexts — most everything can be negotiated on the go. Human processing appears to be flexible in this respect, and oriented toward...
متن کاملCross lingual Information Retrieval
A large amount of information in the form of text, audio, video and other documents is available on the web. Users should be able to find relevant information in these documents. Information Retrieval (IR) refers to the task of searching relevant documents and information from the contents of a data set such as the World Wide Web (WWW). A web search engine is an IR system that is designed to se...
متن کاملTranslation-Based Indexing for Cross-Language Retrieval
Structured queries have proven to be an effective technique for crosslanguage information retrieval when evidence about translation probability is not available. Query execution time is adversely impacted, however, because the full postings list for each translation is used in the computation. This paper describes an alternative approach, translation-based indexing, that improves query-time eff...
متن کاملDifferent approaches to Cross Language Information Retrieval
This paper describes two experiments in the domain of Cross Language Information Retrieval. Our basic approach is to translate queries word by word using machine readable dictionaries. The first experiment compared different strategies to deal with word sense ambiguity: i) keeping all translations and integrate translation probabilities in the model, ii) a single translation is selected on the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information Technology in Industry
سال: 2021
ISSN: ['2204-0595', '2203-1731']
DOI: https://doi.org/10.17762/itii.v9i1.269